NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Event-Driven Force Measurement of a Variable-Stiffness Robotic Finger Using Masked Autoencoder Pre-Training

https://doi.org/10.1109/LRA.2025.3562793

Guo, Qianyu; Lu, Yawen; Fu, Jiaming; Gan, Dongming (June 2025, IEEE Robotics and Automation Letters)

Free, publicly-accessible full text available June 1, 2026
Force-EvT: A Closer Look at Robotic Gripper Force Measurement with Event-Based Vision Transformer

https://doi.org/10.1109/ReMAR61031.2024.10617687

Guo, Qianyu; Yu, Ziqing; Fu, Jiaming; Lu, Yawen; Zweiri, Yahya; Gan, Dongming (June 2024, IEEE)

Full Text Available
Optical Flow as Spatial-Temporal Attention Learners

https://doi.org/10.1109/TPAMI.2024.3463648

Lu, Yawen; Han, Cheng; Wang, Qifan; Fan, Heng; Kong, Zhaodan; Liu, Dongfang; Chen, Yingjie (January 2024, IEEE Transactions on Pattern Analysis and Machine Intelligence)

Full Text Available
From Local to Holistic: Self-supervised Single Image 3D Face Reconstruction Via Multi-level Constraints

https://doi.org/10.1109/IROS47612.2022.9982284

Lu, Yawen; Sarkis, Michel; Bi, Ning; Lu, Guoyu (October 2022, IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS))

Single image 3D face reconstruction with accurate geometric details is a critical and challenging task due to the similar appearance on the face surface and fine details in organs. In this work, we introduce a self-supervised 3D face reconstruction approach from a single image that can recover detailed textures under different camera settings. The proposed network learns high-quality disparity maps from stereo face images during the training stage, while just a single face image is required to generate the 3D model in real applications. To recover fine details of each organ and facial surface, the framework introduces facial landmark spatial consistency to constrain the face recovering learning process in local point level and segmentation scheme on facial organs to constrain the correspondences at the organ level. The face shape and textures will further be refined by establishing holistic constraints based on the varying light illumination and shading information. The proposed learning framework can recover more accurate 3D facial details both quantitatively and qualitatively compared with state-of-the-art 3DMM and geometry-based reconstruction algorithms based on a single image.
more » « less
Full Text Available
Self-supervised Depth Estimation from Spectral Consistency and Novel View Synthesis

https://doi.org/10.1109/IJCNN55064.2022.9891946

Lu, Yawen; Lu, Guoyu (July 2022, IEEE)

Full Text Available
Multi-view Geometry Consistency Network for Facial Micro-Expression Recognition From Various Perspectives

https://doi.org/10.1109/IJCNN55064.2022.9892565

Parikh, Devarth; Lu, Yawen; Kasabov, Nikola; Lu, Guoyu (July 2022, IEEE)
An Unsupervised Approach for Simultaneous Visual Odometry and Single Image Depth Estimation

https://doi.org/10.1109/IJCNN55064.2022.9892294

Lu, Yawen; Lu, Guoyu (January 2022, IEEE International Joint Conference on Neural Network (IJCNN))

Visual odometry (VO) and single image depth estimation are critical for robot vision, 3D reconstruction, and camera pose estimation that can be applied to autonomous driving, map building, augmented reality and many other applications. Various supervised learning models have been proposed to train the VO or single image depth estimation framework for each targeted scene to improve the performance recently. However, little effort has been made to learn these separate tasks together without requiring the collection of a significant number of labels. This paper proposes a novel unsupervised learning approach to simultaneously perceive VO and single image depth estimation. In our framework, either of these tasks can benefit from each other through simultaneously learning these two tasks. We correlate these two tasks by enforcing depth consistency between VO and single image depth estimation. Based on the single image depth estimation, we can resolve the most common and challenging scaling issue of monocular VO. Meanwhile, through training from a sequence of images, VO can enhance the single image depth estimation accuracy. The effectiveness of our proposed method is demonstrated through extensive experiments compared with current state-of-the-art methods on the benchmark datasets.
more » « less
Full Text Available
Bridging the Invisible and Visible World: Translation between RGB and IR Images through Contour Cycle GAN

https://doi.org/10.1109/AVSS52988.2021.9663750

Lu, Yawen; Lu, Guoyu (November 2021, IEEE)

Full Text Available
Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment

https://doi.org/10.1145/3474085.3475689

Wang, Yuxing; Lu, Yawen; Xie, Zhihua; Lu, Guoyu (October 2021, ACM)

Full Text Available
Self-Supervised Single-Image Depth Estimation From Focus and Defocus Clues

https://doi.org/10.1109/LRA.2021.3092258

Lu, Yawen; Milliron, Garrett; Slagter, John; Lu, Guoyu (October 2021, IEEE robotics automation letters)

Self-supervised depth estimation has recently demonstrated promising performance compared to the supervised methods on challenging indoor scenes. However, the majority of efforts mainly focus on exploiting photometric and geometric consistency via forward image warping and backward image warping, based on monocular videos or stereo image pairs. The influence of defocus blur to depth estimation is neglected, resulting in a limited performance for objects and scenes in out of focus. In this work, we propose the first framework for simultaneous depth estimation from a single image and image focal stacks using depth-from-defocus and depth-from-focus algorithms. The proposed network is able to learn optimal depth mapping from the information contained in the blur of a single image, generate a simulated image focal stack and all-in-focus image, and train a depth estimator from an image focal stack. In addition to the validation of our method on both synthetic NYUv2 dataset and real DSLR dataset, we also collect our own dataset using a DSLR camera and further verify on it. Experiments demonstrate that our system surpasses the state-of-the-art supervised depth estimation method over 4% in accuracy and achieves superb performance among the methods without direct supervision on the synthesized NYUv2 dataset, which has been rarely explored.
more » « less
Full Text Available

« Prev Next »

Search for: All records